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ABSTRACT 

This paper reports the consistency of phonic 
generalization .employing two kinds of frequency determined from a 
corpus of the 18,000 m^st frequently occurring , words in the English 
language. Twenty-two commonly taught phonic generalizations were 
analyzed using a computer. It was concluded that consistency alone is 
not a sufficient criterion on which to judge the utility of a rule. 
The authors found that when the rule was a simple one-to-one 
corresponrdence (e.g., "ch" is always pronounced, as in "chair"), then 
direct rule learning^was most effective. However> when the rule was 
more complex (simulation of . the final "e" rule), then direct rule 
learning was not as effet:tive as a more inductive approach. This . 
suggests that teaching rules directly is only advantageous over the 
other approaches when the rule is a simple one-to-one correspondence. 
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-5f Several authors (Clymer, 1963; Fry, 1964j Oaks. 1952-, Enans, 1966- ■ 

r-i Burmeister, 1966) have studied the consistency of various phonics rules 

^ in English that iniqht be useful in teachinn children to read. In qeneral , 

it has-been assumed that if a rule is highly consistent, then it will be 
rH a usreful one to teach. Consistency in the past has been determined by 

Q computing the percentage of times a rule is found to occur in a relatively 

small corpus of words used by beginning readers. 

In addition to consistency, rule complexity rndy be an important 
♦variable that has been iqirored by reading researchers. If one were to 
translate English into another language by'writind it, then very complex 
rules might be useful — since th« trar^slator would have enough time to 
apply the rules. However, reading is a very rapid process, even for 
beginners-, so that the time taken to apply any but the simplest rules 
would make their utility suspect. Even the relatively, simple soft c - 
hard c" rule reouires considerable processing time. In its sinplest 
form, "It is "c followed by e or i sounds like s, otheVwise c pro- 
nounced k". While it is impossiFle to determine the amount of time it 
takes for this processing, it is likely to be longer than even the 
beqinnina reader can afford. • 

This paper renorts the consistency of phonic qeneral izations employing 
two kinds of frequency determined from a corpus of the 18,000 inost frequently 
- occurring' words in the English language. These r^esults are related to 

• the results of a simulated reading study on rule complexity which has,...,. 

been previously reported (Caldwell, Peckham, & Nix, 197iah 

. As part of a larger study (Caldwell, Peckham, t Nii» 1972), a 
corpus of the most frequently occurrinn 18,000 words in the English 
language (Kucera 6 Francis, 1967) v/as analyzed to detenine the frequency 
of occurrence of the b^grans, triqrans, tetragrams. peWtagrams ; hexaqrams, 
and heptagrams (hereafter referred to as ngrams) . All I 
words in Which they occur v/ere part of the output. Thet 
words in running text was also included. Thus, for an.^ 
of frequency are available^ 1) the number of words ir^ which the nqram 
appears out of the 18,000, and 2) ' the frequency oT the ngram in approxi- 
mately a million words of running text. In addition, the^various £ro- 
nun ciations of each ngram and their two types of frequencife^ are available 
by human inspection of the outplit (for a more complete d^scripti^m, see ; 
Caldwell, Peckham. I Nix, 197^b). \ . i 
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Procedures 



Tventy-two cofrnonly tauaHt phonic qeneralizations were analyzed 
usi'nn the conputer output described above- For example, to determine the 
utility of the rale concerning pronunciation of hard £9 soft all the 
words In the corpus titat contain the biqran ce or ci vtere examined to 
determine the utility of the rule concerhing pronunciation and the 
frequency of those words in runninn text were calculated and converted to 
percentaqes. ' ' 



Results and Conclusions 

Table 1 gives the results of this analusis in addition to results 
of similar analyses employ inq more'linited corpii. In general, the consis- 
tencies of the rules parallel fairly, closely those studies usinn a much 
more limited corpus. The greatest discrenencies are Withr the third and 
fourth vov/el qeneralizations (see Table 1). The third general ii:ati on 
(a following w) is much more consistent than previous counts indicate v^'hen 
tFe frequency in rurining text is considered. (In terms. of number or words 3 
it is, less consistent than In previous studies). Th€> word ms ^ v/hich 
has a yery high frequency, largly accounts for this. The fourth vov/el 
qeneralization (final e rule) is considerable less consistent than past 
studies have indicated - less than 40% in running text. Vov/el qeneralization 
five (ar followed by final e) would be lOOSK consistent if are , the single 
exception '(frequency 4393), were taught separately. 

Vov/el generalization six {tv/o vov/el rule) 1s correct less than 30% 
of the time In runninq text^ Its value Is neqlioable since rule application 
is actually detrimentcil . However, soipe of the sub-elements of the rule may 
•well be very consistent in their pronunciations. For example, au is 
pronounced as in sauce in 83% of the total frequencies. The qeneralization 
v/ould he a useful on3 to teach as such, especially since 15 of the remaininq 
17% of the cases are forms of lauc|h or beauty . If these -exceptions v/ere 
taught as sinht words, one i/oulcj have accounted for 90% of the frequency 
of occurrence by teacfiin'q three mapp.lnqs of the bigram-au. In addition, 
many sub- elements become useful simply by revising the nTle (the first letter 
is short/silent and the second is long). 

As suggested earl ier^ consistency alone is not 9 sufficient criterion 
on i^hich to judge the utility of a rule. If the rule is sufficiently complex 
there are no exceptions. At the simplest level , the rule might merely list . 
all of the exceptions. Caldv/el>, Peckftaro, and Nix (1973a) found evidence^ 
to support the contention that teachinq complex rules is not useful in 
the decoding task. In a simulated reading task, the authors found that 
when the rule was a simple one-to-one correspondence (ie. ch is always 
pronounced as in chair) ^ then direct yule learnTug v/as most effective. 
However, v^en tfte rule woa ir^orc complex (simulation of the final 0 rulq*), 
then direct rule learninq vras not as effective as a more inductive approach. 
This suggests that teaching rules dflrectly is onV advantageous over the 
other approaches when the rule is a simple one-to-one correspondence. 



/ Table 2 is a classif feat ion of the twenty-two generalizations 
accpnjing to the variables cf consistency and compleyity. Consistency is 
defined as 80% or better in running text arKf complexity is dicotosmzed as- 
oncrto-one correspondence-simple; all others -complex. It is proposed that 
only those oeneralizations in the upper left q^iadrant are useful... However 
as indicated earlier, by cxamininq frequencies of sub-elements some of 
the rules, new rules, less complex and more consistent in nature, are 
likely tp be found. In this way, some of these rules iti other cfuadrants 
can be moved to the upper left t^uadrant. 
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